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Abstract 

Characterization of natural allelic diversity and understanding the genetic structure and linkage disequilibrium (LD) pattern 
in wild germplasm accessions by large-scale genotyping of informative microsatellite and single nucleotide polymorphism 
(SNP) markers is requisite to facilitate chickpea genetic improvement. Large-scale validation and high-throughput 
genotyping of genome-wide physically mapped 478 genie and genomic microsatellite markers and 380 transcription factor 
gene-derived SNP markers using gel-based assay, fluorescent dye-labelled automated fragment analyser and matrix-assisted 
laser desorption ionization-time of flight (MALDI-TOF) mass array have been performed. Outcome revealed their high 
genotyping success rate (97.5%) and existence of a high level of natural allelic diversity among 94 wild and cultivated Cicer 
accessions. High intra- and inter-specific polymorphic potential and wider molecular diversity (11-94%) along with a 
broader genetic base (13-78%) specifically in the functional genie regions of wild accessions was assayed by mapped 
markers. It suggested their utility in monitoring introgression and transferring target trait-specific genomic (gene) regions 
from wild to cultivated gene pool for the genetic enhancement. Distinct species/gene pool-wise differentiation, admixed 
domestication pattern, and differential genome-wide recombination and LD estimates/decay observed in a six structured 
population of wild and cultivated accessions using mapped markers further signifies their usefulness in chickpea genetics, 
genomics and breeding. 
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Introduction 

Chickpea [Cicer arietinum (L.)] is one of the most important 
food legume crops in the world and serves as an important source 
of protein with essential amino acids in human diet. To improve 
the productivity and sustainability of agriculture, the development 
of high-yielding durable stress tolerant (climate resilient) cultivars is 
the prime objective in current chickpea genomics and molecular 
breeding research. The systematic and extensive breeding efForts 
of chickpea have resulted in the development of a number of high- 
yielding stress tolerant cultivars through intra- and inter-specific 
hybridization of a few selected germplasm accessions. Therefore, 
the genetic base of modern chickpea cultivars is narrow which in 
turn limits the yield enhancement and increased stress tolerance. 
Furthermore, the earlier archaeological, phenotypic and molecular 
phylogeny studies inferred that chickpea domestication might have 
led through maximum adaptation-based selection pressure, 
followed by punctuation by a series of four sequential evolutionary 
bottlenecks during their evolutionary transition (~ 10,000 years) 
from wild C. reticulatum, which progressively narrows down the 



genetic base in cultivated gene pool [1-5]. Chickpea belonging to 
the genus Cicer includes many annual and perennial wild species 
that represent a valuable and potential source of diverse gene pool 
for improving yield and abiotic/biotic stress tolerance in cultivated 
chickpea [6-11]. The genetic improvement of chickpea thus 
necessitates the identification and exploitation of its genetically 
diverse wild relatives with beneficial traits to broaden the genetic 
base of modern cultivars though inter-specific hybridization. The 
significance of inter-specific hybridization by introducing the 
desirable target traits of agricultural importance specifically from 
wild species of primary gene pool to cultivated varieties for 
enhancing their seed and pod yield and stress tolerance have been 
demonstrated in chickpea [7,8,10,12]. However, producing these 
successful inter-specific hybrids relied upon combinations of 
accessions selected from each of the cultivated and wild species 
for cross-poUination and their level of genetic variability to 
ascertain suitable cross-compatibility. 

The genetic diversity and evolutionary relationships among 
different annual and perennial Cicer species have been studied 
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based on agro-morphological traits, inter-specific hybridization, 
karyotyping, seed storage protein and isozyme profiling [13-24]. 
Additionally, the utility of DNA-based markers for unbiased 
estimation of molecular diversity and establishing precise phylo- 
genetic relationships among cultivated and wild Cicer species as 
compared to morphological, cytological and biochemical markers 
have been well understood [25-31]. The PCR-based random 
molecular markers like RAPD (random amplified polymorphic 
DNA) and AFLP (amplified fragment length polymorphism) have 
also been utilized to understand the genetic diversity pattern and 
phylogenetic relationships among annual and perennial CAcer 
species [25-31]. All these diversity studies have a general 
consensus regarding classification of annual and perennial Cicer 
species into three crossabUity groups comprising of primary, 
secondary and tertiary gene pools with certain controversies on 
phylogenetic origination of C. echinospermum, C. hijugum and C. 
pinnalifidum from these gene pools. Moreover, these studies 
documented narrow genetic v ariation within cultivated and wild 
Cicer species due to the inclusion of limited number of accessions 
per species and use of less informative random marker systems. 

The desirable genetic attributes (co-dominant inheritance, 
reproducibility, bi-/multi-allelic nature and abundant genomic 
distribution) of sequence-based robust microsatellite/simple se- 
quence repeat (SSR) and single nucleotide polymorphism (SNP) 
markers have encouraged their utilization for many applications of 
chickpea genetics, genomics and breeding including cultivar 
identification, genetic diversity analysis and establishing phyloge- 
netic relationships among culti\-at(;d and wild species [32—40]. 
However, all these studies have utilized a limited number of 
genomic and genie microsatellite and SNP markers (~200 
markers) with no prior information on their chromosomal 
distribution for assaying a smaller set of accessions (at most 20) 
specifically hcdonging to three different wild annual Cicer species 
(C. reliciilalum, C. echinospermum and C. hijugum) of the primary 
gene pool. Therefore, such studies are not able to infer realistic 
estimate of genetic variability existing among the accessions within 
individual wild and cultivated Cicer species. Rather, they produced 
ambiguous molecular diversity and phylogenetic information with 
no clear differentiation among evolutionarily closely related wild 
Cicer accessions belonging mainly to the primary gene pool, which 
has higher cross-compatibility with cultivated species. An under- 
standing of the extent of natural diversity and phylogi;netic 
relationships among accessions belonging to each of the primary, 
secondary and tertiary gene pools is essential to develop novel 
breeding strategies for chickpea genetic improvement. According- 
ly, a large number of genie and genomic microsatellite and SNP 
markers showing genome-wide physical distribution (on a bp-scale) 
as well as representing ID (linkage disequilibrium) and haplotype 
blocks on the eight chromosomes of chickpea are required for 
monitoring introgression and broadening the genetic base of 
chickpea cultivars through inter-specific hybridization. It could 
thus expedite tlu' jjrocess of transferring novel genes/ alleles of 
agronomic importance from wild to cultivated chickpea species for 
their genetic improvement. 

The recent released draft genome sequences of desi and kabuli 
chickpea cultivars [41,42] could help to select a large number of 
genome-wide microsatellite and SNP markers from pseudomole- 
cules and LD/haplotype blocks of eight chromosomes based on 
their physical positions (bp) for realistic estimation of molecular 
diversity and phylogenetic relationships among cultivated and wild 
Cicer accessions. Recently, thousands of SNPs discovered by RAD 
(restriction site-associated DNA) sequencing of four C. reticulatum 
and one C. echinospermum wild accessions distributed over eight 
chickpea chromosomes have also been utilized to understand their 



population genetic structure and diversity pattern as compared 
with cultivated C. arietinum accessions [42]. However, this study 
could not detect species-wise difiFerentiation/groupings of cultivat- 
ed and wild accessions possibly due to use of few selected (five) wild 
accessions for genotyping and diversity analysis. Moreover, no 
systematic efforts has yet been made to understand the existing 
molecular diversity pattern, population genetic structure and 
phylogenetic relationships among a larger and diverse set of wild 
Cicer accessions based on flmctionally relevant genie regions of 
their genome that are more prone to phenotypic selection during 
chickpea domestication. Mining of novel allelic variants from wild 
accessions and their large-scale validation through high-through- 
put genotyping of suitable informative microsatellite and SNP 
markers is also imperative, which wiU facilitate identification of 
diverse wild germplasm with beneficial traits to broaden the 
genetic base of cultivars and enhancing the genetic potential of 
chickpea globally. 

Keeping all above in view, the present study was undertaken to 
validate and genotype genome-wide well distributed genomic and 
gene-derived (transcription factor) microsatellite and SNP markers 
that are physically mapped (bp) across eight chickpea chromo- 
somes in 94 wild and cultivated Cicer accessions using gel-based 
assay, fluorescent dye-labelled automated fragment analyzer and 
34-plex MALDI-TOF (matrix-assisted laser desorption ionization- 
time of flight) mass array. The genotyping information of validated 
markers was further utilized for understanding the functional 
molecular diversity pattern, population genetic structure and 
phylogenetics, and determining the genome-wide and population- 
specific LD (linkage disequilibrium) patterns specifically in wild 
Cicer accessions. 

Materials and Methods 

Plant resources and genomic DNA extraction 

A total of 94 different Cicer accessions belonging to one annual 
cultivated C. arielinum (12 accessions), five annual wild species, 
namely C. reUrulalum (16), C. echinospermum (8), C. judaicum 
(22), C. hijugum (19) and C. pinnatifidum (16) and one perennial 
species C. microphyllum (Table 1, Table SI) were used for large- 
scale validation and genotyping of microsatellite and SNP 
markers. The genomic DNA was isolated from the young leaf 
samples of 94 accessions using QIAGEN DNeasy 96 Plant Kit 
(QIAGEN, USA) foUowing the manufacturer's instructions. 
Despite of wide availability of wild chickpea accessions in different 
Genebanks, poor viability, germination and regeneration efficien- 
cy largely limits their utilization. Henceforth, we were able to use 
only 82 accessions belonging to six annual and perennial wild 
chickpea species for understanding their diversity and domestica- 
tion patterns. 

Large-scale validation and genotyping of microsatellite 
markers 

A set of 343 microsatellite markers designed from gene 
sequences including transcription factors (TPs) of C. arietinum 
desi ICC 4958 [41] and C. arietinum kabuli CDC Frontier [42] 
and 153 genomic microsatellite markers were selected based on 
their genome-wide physical distribution on eight chromosomes. 
These designed markers were PGR amplified using genomic DNA 
of 94 accessions (Table 1, Table SI). The constituents and 
cycfing conditions used for PGR amplification were described by 
Jhanwar et al. [43] and Agarwal et al. [44]. The amplified PGR 
fragments were resolved in 3.5% metaphor agarose gel and 
automated fragment analyzer. For performing automated frag- 
ment analysis, the ffuorescent dye-labelled microsateUite markers 
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were resolved in ABI3730xl DNA analyzer following the 
procedure of Kujur et al. [45] . The electrophoregram containing 
trace files generated for each microsatellite marker were analysed 
in GeneMapper V4.0 (Applied Biosystems, USA) with optimized 
parameters and automatic allele calling was performed. The actual 
allele size (bp) and fragment length polymorphism detected by 
microsatellite markers in 94 accessions based on gel-based assay 
and automated fragment analysis were obtained. 

Discovery and high-throughput genotyping of IF gene- 
derived SNPs 

A set of 500 TF-encoding genes of ICC 4958 and CDC Frontier 
were selected based on their genome-wide physical localization 
(bp) on eight chickpea chromosomes. The forward and reverse 
primers targeting the functional domain and upstream regulatory 
regions of TF genes with amplification product size of 500 to 
700 bp were designed employing Batch PrimerS (http:/ /probes. 
pw.usda.gov/batchprimer3). The synthesized 500 gene-based 
markers were amplified using the genomic DNA of ICC 4958 
and ICC 17160 and resolved in 1.5% agarose gel. The amplified 
PCR product of 479 markers showing clear, reproducible and 
monomorphic amplification between two chickpea accessions 
(ICC 4958 and ICC 17 160) were purified and sequenced in both 
forward and reverse directions at least two times on ABI (Applied 
Biosystems) 3730 xl automated DNA sequencer using the target 
gene-based primers. The high-quality consensus sequences with 
high phred (&30) and phrap (S80) scores thus obtained were 
compared among two accessions and SNPs were detected in the 
TF genes employing the integrated software tool polyphred with a 
99% accuracy score. The SNPs detected through polyphred were 
further manually inspected and confirmed by aligning the high 
quality FASTA sequences between two chickpea accessions 
through CLUSTALW multiple alignment and graphical interface 
tools of BIOEDIT (http://www.mbio.ncsu.edu/BioEdit/bioedit. 
html). Based on these analyses, finally a set of high-quality 384 
SNP loci (one in each TF genes) differentiating ICC 4958 and ICC 
1 7 160 was selected. 

These TF gene-derived 384 SNPs were further genotyped in the 
genomic DNA of 94 cultivated and wild Cicer accessions (Table 
SI) using Sequenom MALDI-TOF MassARRAY system (http:// 
www.sequenom.com). Reproducibility of SNP genotyping assays 
were evaluated using seven samples (accessions) as biological 
rephcates, one each representing the wild and cultivated Cicer 
species. The target SNP loci along with their flanking 100 bp 
sequences were curated for individual genes. The Sequenom 
MassARRAY multiplex iPLEX assay including pre-amplification 
primary forward and reverse PCR primers and one unextended 
primer (UEP) [act as single nucleotide extension primers (EP)] for 
each 384 SNP loci were designed using MassARRAY Assay 
Design software v3.1. AH other default parameters except for the 
optimal amplicon size containing the SNP sites of 80-120 bp were 
set in the software for analysis. The EP and UEP were optimized 
for 34-plex assays and procured oligos from IDT, USA. A 1 0-mer 
tag (5'-ACGTTGGATG-3') was added to the 5'-end of each 
forward and reverse pre-amplification primers to improve the 
PCR performance and mass spectrometry. The amplification of 
synthesized pre-amplification primers with multiplex PCR assay, 
shrimp alkaline phosphate (SAP) incubation for neutralizing 
vmincorporated dNTPs, primer extension, resin cleanup and mass 
spectrometry were performed following the manufacturer's 
instructions of Sequenom iPLEX Gold amplification kit (http:// 
www.sequenom.com). The SNP genotyping results obtained in 
iPLEX spectrochip bio-arrays were analyzed using MassARRAY 
Typer 3.4 software tool. The different SNP alleles among 94 
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accessions were visualized and documented based on allele-specific 
differences in mass between extension products. 

Assessment of polymorphic potential, molecular diversity 
and population genetic structure 

The genotyping information of validated 478 microsateUite 
markers and 380 TF gene-derived SNP markers was analysed in 
PowerMarker v3.51 [46] to estimate the polymorphic allele 
number/locus, major allele frequency, per cent polymorphism, 
PIC (polymorphism information content), gene diversity [mea- 
sured as expected heterozygosity (He)] and heterozygosity. Intra- 
specific polymorphic potential of markers within individual species 
was estimated as proportion of total number of alleles amplified by 
microsateUite and SNP markers showing polymorphism among 
accessions belonging to each of seven wild and cultivated Cicer 
species [45,47]. The molecular basis of microsateUite marker 
fragment length polymorphism across wUd and cultivated species 
based on expansion/contraction of repeat-units at their target 
locus and/ or marker size homoplasy (as expected particularly in 
case of diverse inter-species comparison) was inferred. For this, the 
PGR products of 50 randomly selected genie and genomic 
microsateUite markers amplified from one accession representing 
each of seven different wild and cultivated species were cloned and 
sequenced twice in both forward and reverse direction through a 
capUlary-based automated DNA sequencer and the derived high 
quality sequences were aligned and compared among accessions 
following the methods of Kujur et al. [45]. 

The genotyping data of 858 markers (including 478 microsat- 
eUite and 380 SNP markers) physicaUy mapped across eight 
chromosomes of chickpea were used to determine functional 
molecular diversity and population structure and for establishing 
genetic relationships among 94 cultivated and wUd Cicer 
accessions. The cluster analysis among accessions was performed 
based on Nei's genetic distance [48] employing the neighbour- 
joining (NJ) method (with 1,000 bootstrap replicates) in Power- 
Marker v3.5 1 and unrooted phylogenetic tree was visualized using 
MEGA v4.0 [49]. For deriving significant correlation between 
microsateUite and SNP markers based on their potential to detect 
molecular diversity, the genetic distance estimated for aU pair-wise 
combinations of 94 accessions using these markers was plotted 
individually against each other and compared using the Mantel 
test [50]. 

The genome-wide 858 marker genotyping data generated from 
94 wUd and cultivated Cicer accessions were used in a model based 
program STRUGTURE v2.3 [51] employing the admixture and 
correlated aUele frequency with a burn-in of 100,000 iterations 
and run length of 100,000. Analysis of population genetic structure 
among accessions was carried out using the Bayesian clustering 
algorithm of STRUCTURE with varying levels of K (number of 
populations) =1 to 10. At the most, we expected up to 10 
populations (K= 10) could be due to clustering of 94 accessions 
into six distinct population groups based on our preliminary 
genetic distance-based phylogenetic tree analysis. The average of 
likelihood value, Ln P(D) against each K across 20 independent 
replications was estimated and plotted. The optimal value of K 
was determined according to ad hoc method of Pritchard et al. 
[51] and delta K procedure of E\'ami() ct al. [52]. Using the 
optimum K, the population structure model representing better 
relationships among 94 cultivated and wUd Cicer accessions was 
constructed. Various population genetic parameters including the 
efficiency of markers for estimating genetic divergence (Fgx) and 
degree of admixture among different population groups was 
estimated. Analysis of molecular variance (AMOVA) among 
accessions and populations, and within populations and accessions 



was performed using both GenALEX 6.4 [53] and PowerMarker 
[50]. 

Estimation of LD patterns 

The correlation in frequency among pairs of aUeles across a pair 
of 858 marker loci that are physicaUy mapped on eight 
chromosomes were analysed using TASSEL v2.1 [54] to 
determine genome-wide LD patterns as r^ (average correlation 
coefficient) among 94 cultivated and wUd Cicer accessions 
belonging to diverse population groups (as inferred by population 
structure). The significance (P-value) of LD estimates was 
determined at 100,000 permutations. The marker loci mapped 
on the same and different chromosomes were considered as linked 
and unlinked markers, respectively. The LD was estimated using 
the genotyping information of linked, unlinked and global (both 
linked and unlinked) markers. To determine the LD occurrence 
due to physical Unkage of marker loci, 99* percentUe of r^ 
distribution for unlinked markers was considered as background 
level of LD [55]. The decay of LD with the physical distance was 
measured by combining the r^ values of marker-pairs located in 
the physical intervals of 0-200, 200-400, 400-600, 600-800, 800- 
1000 and 1000-1200 kb across eight chromosomes of chickpea. 
The graph was plotted between pooled r~ and physical distance 
(kb) based on nonlinear regression model considering the r^ value 
= 1 at marker physical distance of 0 kb [55,56] and finally the 
trend of LD decay was estimated in both cultivated and wUd Cicer 
accessions. The statistical significance test comparing r^ values of 
LD across population groups was performed using ANOVA 
(analysis of variance) interface tool of SPSS vl7.0 (http://www. 
spss.com/ statsistics, IBM SPSS Inc. for window, Chicago, USA). 

Results 

Large-scale validation and high-throughput genotyping 
of microsateUite and SNP markers 

A set of 496 (including 343 genie and 153 genomic microsat- 
eUite markers) microsateUite markers distributed (physically 
mapped) across eight chickpea chromosomes showing reproduc- 
ible amplification were genotyped in 94 cultivated and wUd Cicer 
accessions (Table 1) using the agarose gel-based assay and 
fluorescent dye-labeUed automated fragment analyser (Figure 1). 
Four hundred seventy-eight (including 334 genie and 144 genomic 
microsateUite markers) of 496 microsateUite markers (96%) showed 
polymorphism among 94 Cicer accessions (Table S2). Moreover, 
we disco\crcd 384 novel TF genc'-dcri\cd high-quality SNPs 
(Table S3) showing differentiation between IGG 4958 and IGG 
17160 based on amphcon sequencing. These identified SNPs 
showing genome-wide physical distribution (bp) on eight chickpea 
chromosomes were genotyped on 94 cultivated and wUd Cicer 
accessions using a maximum of 34-plex (Figure SI) high- 
throughput MALDI-TOF Serjuenom mass array genotyping 
assay. The optimization and genotyping of these multiplex assays 
on 94 accessions validated 380 SNPs with high degree of 
reproducibility (100%) and genotyping success rate (98.9%). For 
each SNP locus, a caU cluster plot was generated depicting the low 
and high mass homozygote and heterozygous yields (Figure 2). 
The derived genotype calls of each accessions were automatically 
called and compared the masses of the peaks with the pre- 
determined masses estimated during pre-PGR amplification and 
single nucleotide extension primers designing. The multiplex assay 
of 384 SNPs was overaU successful in discriminating homozygous 
and heterozygous alleles for 380 SNP loci (Figure 2), which 
produced 35720 genotype caUs showing polymorphism among 94 
accessions. The remaining four SNPs with 376 genotyping data 
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were identified either as monomorpliic or without genotype calls. 
The functional annotation of experimentally validated informative 
334 microsateUite markers- and 380 SNP markers-containing 
genes revealed their maximum correspondence with TF gene 
families like bZIP (basic leucine zipper), bHLH (basic helrx-loop- 
helk) and AP2-EREBP (APETALA2 ethylene-responsive element 
binding proteins) (525, 73.5%), followed by growth and metabo- 
lism-related enzymes (170, 23.8%) and expressed proteins (19, 
2.7%) (Figure S2). 

Intra-/lnter-specific polymorphic potential of 
microsateUite and SNP markers 

The large-scale validation and genotyping of 880 comprising of 
496 microsateUite and 384 SNP markers identified 858 (97.5%) 
markers (including 478 microsateUite and 380 SNP markers) as 
informative showing polymorphism among 94 cultivated and wUd 



Cicer accessions (Figure SI). These polymorphic markers 
amplified a total of 3703 aUeles which varied from 2 to 13 with 
an average of 4.26 aUeles per marker locus (Table S4). The PIG 
and gene diversity estimated for the informative microsateUite and 
SNP markers ranged from 0.12 to 0.98 and 0.32 to 0.99 with a 
mean of 0.56 and 0.60, respectively. Even though the extent of 
polymorphism detected by microsateUite (96.4%) and SNP 
(98.9%) markers among 94 accessions was comparable with each 
other, the microsateUite markers, however revealed higher allele 
numbers (2-13; mean: 6), PIC (0.18-0.98; mean: 0.75) and gene 
diversity (0.18-0.99; mean: 0.80) in contrast to that estimated 
using SNP markers (aUele number: 1-2; mean: 1.9, PIC: 0.02— 
0.47; mean: 0.31, gene diversity: 0.22-0.60; mean: 0.35) (Table 
S4). The heterozygosity and major aUele frequency among 
accessions detected by both microsateUite and SNP markers 
varied from 0.01 to 0.32 and 0.20 to 1.0 with an average of 0.02 
and 0.70, respectively (Table S4). Maximum heterozygosity and 




Figure 1. MicrosateUite marker-based genotypic variation among wild and cultivated chicl<pea. Allelic variation detected among a 
representative set of cultivated and wild accessions belonging to seven Cicer species using normal unlabeled and fluorescent dye-labelled genie and 
genomic microsateUite markers by gel-based assay (A) and automated fragment analyzer (B), respectively. A maximum of 1 3 polymorphic alleles were 
amplified by markers among 94 accessions using the gel-based assay and automated fragment analyzer. The fragment sizes (bp) for all the alleles are 
indicated. M: 50 bp DNA size standard. 
doi:1 0.1 371/journal.pone.01 07484.g001 
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Figure 2. SNP marker-based genotypic variation among wild and cultivated chickpea. Call cluster plots for two representative SNP loci (C/ 
T) demonstrating the genotyping information of all 94 cultivated and wild Cicer accessions assayed with IVIALDI-TOF mass array. Distinct 
differentiation of homozygous and heterozygous SNPs based on mass differences of corresponding alleles are evident. 
doi:1 0.1 371 /journal.pone.01 07484.g002 



major allele frequency among accessions were detected by SNP 
markers (0.04, 0.83) rather than by microsateUite markers (0.0007, 
0.51). Inter-specific polymorphism between cultivated and wild 
species (467, 94.2% polymorphism; allele numbers: 2-1 1 with 
mean: 5, mean PIC: 0.52 and mean gene diversity: 0.55) was 
higher in contrast to intra-specific polymorphism within cultivated 
{desi and kahuli) and wild species (158, 18% polymorphism; allele 
number: 1-4 with mean: 2.0, mean PIC: 0.25 and mean gene 
diversity: 0.26). Intra-specific polymorphic potential detected by 
markers was maximum in C. reticulatum (90.2% polymorphism, 
allele numbers: 2 to 8, PIC: 0.47 and gene diversity: 0.52), 
followed by C. judaicum (71.4%, 2 to 7, 0.42, 0.45) and minimum 
in C. echinospermum (49.3%, 2 to 5, 0.34, 0.36) (Table S4). 
Maximum heterozygosity of markers was observed in wild C. 
judaicum (0.035), while it was completely absent in the cultivated 
species. In spite of detecting low PIC, gene diversity and allele 
numbers by SNP markers as compared to microsateUite markers, 
the trend of intra- and inter-specific polymorphism observed using 
combined analysis of microsateUite and SNP markers among 94 
Cicer accessions remained alike with that of individual marker 
assays. The degree of polymorphism detected by markers among 
the species/accessions belonging to secondary gene pool (58.4% 
polymorphism, mean PIC: 0.42) was higher compared to primary 
gene pool (47.6%, 0.31). We classified the 94 cultivated and wUd 
accessions under study into two groups as FertUe Crescent 
(Turkey, Syrian Arab Republic, Jordan, Lebanon and Israel) 
and Central Asia (India) based on their corresponding geograph- 
ical origin. Remarkably, the markers revealed much higher 



polymorphic potential among the species/accessions originating 
from FertUe Crescent (79.5% polymorphism, mean PIC: 0.50) in 
contrast to those from Central Asia (35.8%, 0.32). 

The cloning and sequencing of size variant amplicons of a 
selected 50 microsateUite marker sets from the Cicer accessions 
revealed exact correspondence of fragment length polymorphism 
(bp) detected by markers among seven diverse wild and cultivated 
Cicer accessions with expansion and contraction of their repeats 
(Figure S3). However, substitutions of single nucleotides in the 
sequences flanking the microsateUite repeat-motifs across seven 
wUd and cultivated Cicer accessions (Figure S3) was evident. 

Molecular diversity, population genetic structure, 
species/gene-pool-wise domestication pattern and 
phylogenetics in chickpea 

The pair-wise distance matrix among 94 wild and cultivated 
accessions belonging to seven different Cicer species utUizing the 
genotyping information of genome-wide weU distributed (physi- 
cally mapped on eight chromosomes) 858 markers (including 478 
genie and genomic microsateUite markers and 380 TF gene-based 
SNP markers) was estimated. The microsateUite marker-based 
genetic distance among 94 accessions varied from 0.11 to 0.94, 
whUe SNP marker-based genetic distance in these accessions 
ranged from 0. 10 to 0.42. A strong correlation (R^ = 0.77) between 
genetic distance estimated by two marker types (microsateUite and 
SNP markers) based on allele sharing for aU pairs of accessions was 
evident (Figure S4). The use of both microsateUite and SNP 
markers revealed a broad range of genetic distance from 0. 1 1 
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{kahuli cv. ICCV96329 - kahult cv. BG2024) to 0.94 (C. 
reticulatum cv. ICC 17 160 - C. judaicum cv. ILWC283) with an 
average of 0.62. Remarkably, we observed a wider range of 
genetic distance (0.13 to 0.78 with a mean: 0.58) among 94 
acces.sions using genie microsatellite and SNP markers. The mean 
genetic distance between cultivated and wild species varied from 
0.44 (C. arietinum - C. microphyllum) to 0.70 (C. arietinum - C. 
judaicum), while between wild species it ranged from 0.54 (C. 
echinospermum - C. hijugum) to 0.68 (C. reticulatum - C. judaicum) 
(Table S5). Maximum average genetic distance was observed 
among the accessions within C retkulaLum (0.52), followed by C. 
judaicum (0.46) and minimum within C. bijugum (0.35). The 
cultivated desi and kabuli accessions belonging to C. arietinum had 
least genetic diversity as evident from their pair-wise genetic 
distance (0.26) (Table S5). The species/accessions included 
under secondary gene pool (mean genetic distance: 0.46) had 
higher molecular diversity compared to primary gene pool (0.37). 
A greater genetic diversity among the accessions/species originat- 
ed from Fertile Crescent (mean genetic distance: 0.47) in contrast 
to those from Central Asia (0.23) was evident. 

The genetic relationships among 94 Cicer accessions were 
depicted in a neighbour-joining unrooted phylogram (Figure 3). 
The microsatellite and SNP markers clearly diflFerentiated all the 
accessions from each other and resulted in a definite grouping of 
six clusters (I-VI) according to their species and gene pools of 
origination with high boot strap values (91-97). However, no 
distinct differentiation among accessions was observed by use of 
only TF gcnc-bascd SNP markers (Figure S5). Even though 
microsatellite markers could differentiate all 94 accessions from 
each other, they clustered such accessions randomly into only six 
different groups with much deviation from their species/gene 
pools of origination (Figure S6). Interestingly, we identified a 
minimum of two genomic and five genie microsatellite markers 
and two TF gene-derived SNP marker combinations as most 
informative (higher PIC 0.79 to 0.96 and allele numbers 8 to 13), 
which could discriminate all the 94 accessions representing seven 
different Cicer species from each other (Table S2). The cultivated 
species C. arietinum was highly divergent from other five wild 
species grouped in separate clusters with 91% occurrences, but 
was more closer to C. reticulatum and C echinospermum and one 
Indian originated accession of C. microphyllum. 

The population genetic structure among 94 cultivated and wild 
Cicer accessions was determined using genome-wide well distrib- 
uted (physically mapped on eight chickpea chromosomes) 858 
markers employing STRUCTURE. The admixture model-based 
STRUCTURE simulations were conducted at varying levels of 
possible population numbers (K= 1 to 10) with 20 replications. 
The average LnP(D) (log-likelihood) value increased continuously 
with the increase in K from 1 to 10, however its most apparent 
inflection was obtained at one of the best replicate of K = 6 
(Figure S7A). The results of population numbers (K) were further 
confirmed using the second order statistics of STRUCTURE by 
AK estimation.^' A sharp peak with a maximum value of AK was 
obtained at K = 6 (Figure S7B), which confirms the classification 
of 94 accessions into six genetically distinct population groups 
(POP I-VI) with high resolution population structure (Figure 4). 
The POP I contained 12 cultivated (i«,n undkahuli at:cessions of C. 
arietinum and one accession of C. microphyllum. The POP II, III, 
IV, V and VI included accessions from C. hijugum (19 accessions), 
C. echinospermum (8), C. pinnatifidum (16), C. reticulatum (16) 
and C. judaicum (22), respectively. The STRUCTURE-based six 
population group assignment of 94 accessions was further 
comparable to their clustering patterns, species and gene pools 
of origination and phylogenetic relationships (Figure 4) as 



obtained by the neighbour-joining tree analysis using pair-wise 
genetic distances. A strong correlation between allele sharing- 
derived genetic distances estimated individually by microsatellite 
and SNP markers among six population groups was clearly evident 
based on Mantel test (0.81). The estimation of polymorphic 
potential of microsatellite and SNP markers among six population 
groups revealed their maximum polymorphism in population 
group, POP V (allele numbers: 2-8 with mean: 3.3 and PIC: 0.05- 
0.86 with mean: 0.37), followed by POP VI and minimum in POP 
I (Table 2). The molecular genetic variation among and within 
six populations using 858 markers showed a wider level of 
significant quantitative genetic differentiation based on pair-wise 
genetic distance and Fst estimation (Table S5 and S6) 
suggesting the existence of broader population differentiation in 
94 Cicer accessions. The genetic distance varied from 0.31 
between POP II and POP III to 0.86 between POP I and POP 
VI. Similarly, the pair-wise Fsi (P<0.001) ranged from 0.23 
between POP II and POP III to 0.89 between POP I and POP VI 
(Table S6). Notably, the trends of significant genetic divergence 
detected by Fgx among six population groups was consistent with 
that of genetic distance (Table S6). The population differentia- 
tion based on AMOVA revealed that 7.5%, 4.5% and 2.6% (P< 
0.01) of total molecular variation in the six population groups was 
attributed to genetic differentiation among populations, within 
populations (among accessions) and within accessions, respectively. 
Based on the proportion of FgT estimation, divergence between 
population groups (59%) was higher compared to that estimated 
among accessions within populations (41%). Among the five wild 
population groups, the divergence was maximum between POP V 
and POP VI (Est = 0.80) and minimum between POP II and POP 
III (0.23). Higher genetic differentiation between population 
groups than that within populations based on AMOVA and Fst 
in a self-poUinated crop species like chickpea is quite expected. 

AU the 94 cultivated and wild Cicer accessions used in this study 
clearly belonged to a structured population in which about 95 % of 
their inferred ancestry was derived from one of the model-based 
population and remaining ~5% contained admixed ancestry 
(Table SI). Maximum admixed ancestry (~13%) was observed 
in POP V, followed by POP II (-10%) and minimum in POP I 
(~0.1%). The POP V had highest admixed ancestr)' because of 
their maximum allelic admixtures with POP III (—O.S'/o) and POP 
I (~3.7%) (Figure 3). Likewise, we observed maximum admix- 
tures of POP IV (~9. 1 %) with POP II and thus its higher admixed 
ancestry in POP II is expected. 

Estimation of genome-wide and population-specific LD 
patterns in wild Cicer species 

The LD estimates (r^) and extent of LD decay using all possible 
pair-combination of genome-wide well distributed (physically 
mapped) 858 markers on eight chickpea chromosomes was 
determined among 94 Cicer accessions belonging to six population 
groups. In the association panel and whole population groups, 
average r^ of global markers varied from 0.19 in POP VI to 0.27 in 
POP III (mean: 0.22) (Figure 5A) and 17% global marker-pairs 
exhibited significant LD (P<0.001) (Table S7), indicating the 
existence of moderate LD level in the panel of cultivated and wild 
accessions. Remarkal)ly, 43% and 14% of the linked and unlinked 
marker-pairs revealed significant LD (P<0.001) and average of 
linked and unlinked markers were estimated as 0.30 and 0.20, 
respectively. However, the trend of LD estimates and significant 
LD (%) observed among six population groups using global 
markers remained alike with that of linked and unlinked marker- 
pairs. The determination of LD estimates and signfficant LD ('/o) 
using linked marker-pairs that are physically mapped on eight 
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Figure 3. Microsatellite and SNP marker-based genetic diversity and phytogeny among wild and cultivated chickpea. Unrooted 
phylogenetic tree depicting the genetic relationships among 94 cultivated and wild accessions belonging to seven Cicer species based on Nei's 
genetic distance using 334 genie and 144 genomic microsatellite markers and 380 TF gene-derived SNP markers. Molecular classification clearly 
differentiated 94 accessions into six different clusters, which corresponds to their species and gene pools of origination. Percentages of confidence 
obtained in bootstrap analysis are indicated at the corresponding node for each cluster. 
doi:1 0.1 371 /journal.pone.01 07484.g003 



chickpea chromosomes indicated maximum estimates of average 
r^ on chromosome 7 (0.40) and significant LD on chromosome 4 
(99.4%) (Table S7). In contrast to unlinked markers, the linked 
markers detected higher LD estimates and proportion of 
significant LD in all six population groups (94 accessions) and 
also across eight chickpea chromosomes. Further, the LD decay of 
858 marker-pairs by pooling the r^ estimates across eight 
chromosomes and plotting their average r against the physical 
distance of equal intervals (100 kb) up to 1200 kb was determined 
across six population groups. The non-linear regression curve 
exhibited a decreasing trend of LD decay with increase in the 
physical distance (kb) in all the six populations (Figure 5B). 
Overall, all the population groups sustained a significant level of 
LD up to a physical distance of 600 kb. In population groups POP 
I, POP II and POP III, the LD did not decay below r^ = 0. 1 up to 
800 kb physical distance. However, the genetically more diverse 
population groups POP IV, POP V and POP VI showed faster LD 
decay (500-600 kb) than that of less diverse populations POP I, 
POP II and POP III (Figure 5B). 



Discussion 

Large-scale validation and high-throughput genotyping of 
sequence-based microsatellite and SNP markers and their 
utilization in realistic estimate of genetic diversity pattern, 
population genetic structure and phylogenetic relationships among 
wild and cultivated Cicer accessions is essential for enhancing the 
genetic potential of cultivated chickpea. Natural genetic variation 
scanned in the wild Cicer, which is expected to be much wider 
than cultivated species [10], can be transferred from the diverse 
gene pools of wild accessions to cultivated species to facilitate their 
genetic improvement through marker-assisted introgression breed- 
ing. To accomplish these, microsatellite markers distributed over 
eight linkage groups of chickpea based on their genetic positions 
(cM) have been utilized previously for determining the genetic 
structure and molecular diversity particularly in wild Cicer 
accessions [40]. However, the diversity analysis by selecting 
markers based on their genetic positions (cM) on population- 
specific genetic linkage maps usually suffers from poor genome 
saturation primarily because of low representation of such markers 
in the chromosomal regions with low recombinant frequency and 
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Figure 4. Microsatellite and SNP marker-based assessment of population genetic structure among wild and cultivated chickpea. 

Optimization of population structure with varying population number K = 5 to K = 7 and their inferred best possible population genetic structure for 
94 cultivated and wild Cicer accessions using 334 genie and 144 genomic microsatellite markers and 380 TF gene-derived SNP markers physically 
mapped across eight chromosomes. These mapped markers assigned 94 accessions into six population that majorly grouped accordingly by their 
species and gene pools of origination. The accessions represented by vertical bars along the horizontal axis were classified into K colour segments 
based on their estimated membership fraction in each K cluster. Six diverse colours represent different population groups based on optimal 
population number K = 6. 
doi:1 0.1 371 /journal.pone.01 07484.g004 



their population-specific variation of recombination rates. To 
overcome this constraint, the marker sets showing genome-wide 
distribution based on physical positions (bp) rather than genetic 
positions (cM) on entire chromosomes have been utilized for large- 
scale genotyping applications including genetic diversity studies 
and genome mapping in crop plants [57]. The genome-wide non- 
uniform recombination rate is mostly dependent upon the 
magnitude and characteristics of accessions included under a 



diversity panel and their level of molecular diversity. It thus infers 
that the markers selection from constituted LD and haplotype 
blocks in contrast to physical map of chromosomes is more 
relevant and informative concerning the efficient marker-based 
genetic analysis in crop plants [58,59]. Recently, the implication of 
selecting markers from LD and haplotype blocks of chromosomes 
and their effective use in identification of untapped allelic variants 
and genetic combinations of complex traits in a larger natural 



Table 2. Polynnorphic potential (represented by diverse statistical nneasures) detected in six model-based population groups 
(inferred by population genetic structure analysis) using 478 genie and genomic microsatellite markers and 380 TF gene-derived 
SNP markers. 





Population 
groups 


Cicer species 


Number of 
accessions 


Genotype number/ 
locus 


Allele number/ 
locus 


PIC (polymorphism Information 
content) 


Gene diversity 


POP 1 


C. arietinum 


12 


4.19 


2-4 (2.2) 


0.14-0.68 (0.15) 


0.15-0.75 (0.16) 


POP II 


C. bijugum 


19 


4.3 


2-6 (2.6) 


0.05-0.78 (0.25) 


0.05-0.83 (0.27) 


POP ill 


C. echinospermum 


8 


5.06 


2-5 (2.5) 


0.11-0.77 (0.24) 


0.12-0.82 (0.26) 


POP IV 


C. pinnatifidum 


16 


4.54 


2-6 (2.6) 


0.06-0.78 (0.26) 


0.06-0.83 (0.29) 


POP V 


C. reticulatum 


16 


5.48 


2-8 (3.3) 


0.05-0.86 (0.37) 


0.06-0.89 (0.42) 


C microphyllum 1 


POP Vi 


C. judaicum 


22 


4.5 


2-7 (3.0) 


0.04-0.82 (0.32) 


0.05-0.87 (0.35) 


Total 




94 


4.45 


2-13 (4.3) 


0.02-0.98 (0.46) 


0.22-0.99 (0.50) 



doi:1 0.1 371 /journal.pone.01 07484.t002 
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Figure 5. Microsatellite and SNP marker-based estimation of genome-wide and population-specific LD patterns in wild chickpea. 

(A) Estimates of LD (mean r^) for linked, unlinked and global markers (478 genie and genomic microsatellite markers and 380 TF gene-based SNP 
markers). The bar indicates the standard error. *ANOVA significance at p<0.01. (B) LD decay (mean r^) in six population groups as defined by 
population genetic structure. For LD decay, the r^ value of marker physical distance of 0 kb is considered as 1 . The marked dots indicate the mean r^ 
values for marker intervals of 0-200, 200-400, 400-600, 600-800, 800-1000 and 1000-1200 kb, respectively. The curve was drav*/n across the dots 
using non-linear regression model. "All" includes the LD estimates and decay across entire six population groups. 
doi:1 0.1 371/journal.pone.01 07484.g005 



population along with capturing a wide spectrum of allelic 
diversity at whole genome-scale have been well demonstrated in 
HapMap projects of many crop plants, including maize, rice, 
soybean and Medicago (http://www.medicagohapmap.org) 
[58,60,61]. In this context, we utilized genome-wide well 
distributed 343 genie and 153 genomic microsatellite markers 
and 384 TF gene-based SNP markers that are physically mapped 
on eight chickpea chromosomes (possibly constituting LD/ 
haplotype blocks) for their large-scale validation and high- 
throughput genotyping in 94 cultivated and wild Cicer accessions 
(belonging to six annual and one perennial Cicer species) using gel- 
based assay, fluorescent dye-labelled automated fragment analyser 
and MALDI-TOF Sequenom mass array. 

The genotyping success rate and intra- and inter-specific 
polymorphic potential (96.4%) detected by genie and genomic 
microsatellite markers among 94 Cicer accessions is much higher 
than that estimated earlier in annual and perennial wild accessions 
(~40 accessions) using a smaller set of genomic microsatellite 
markers (30-40% [33,34], 63% [35], 71% [36], 88.9% [38], 
47.6% [39]). The genotyping success rate (98.9%) obtained using 
MALDI-TOF mass array is higher as compared to that estimated 
in chickpea using KASPar (competitive aUele-specific PGR) (81%) 
[62] and lUumina GoldenGate (91%) [63] assays, but comparable 
to lUumina VeraGode-based BeadXpress assay (98%) [64]. Even 
though the MALDI-TOF assay showed comparable SNP geno- 
typing success rate, its major limitation of lower marker 
multiplexing potential (~40 SNP-plex) in contrast to other 
available high-throughput SNP genotyping assays like GoldenGate 
and KASPar (hundreds to millions of SNPs multiplexing) has 
impeded the usage of MALDI-TOF assay in large-scale vahdation 
and high-throughput genotyping of SNP loci in crop plants. With 
regard to small-scale (about hundreds) validation and genotyping 
of SNPs in a limited number of accessions, however the MALDI- 
TOF mass array has certain advantages over other high- 
throughput SNP genotyping assays in terms of cost and time 
required for its multiplex primer (UEP and EP) designing/ 
synthesis and assaying. These validated microsatellite and SNP 
markers being derived mosdy from the diverse functionally 



relevant sequence components of genes (TFs) could be useful as 
functional markers for large-scale genotyping applications includ- 
ing identification of genes/QTLs for many qualitative and 
quantitative traits of agricultural importance specifically in wild 
chickpea. The utility of TF genes for regulating diverse growth and 
developmental processes and stress responses in plant species and 
legumes have been well studied [65-67]. 

The inter- and intra-specific polymorphism detected by 
validated 478 microsatellite and 380 SNP markers (total alleles 
numbers: 3703, average 4.26 aUeles/locus, PIC: 0.56 and gene 
diversity: 0.60) among 94 Cicer accessions is higher to that 
obtained earlier using genomic microsatellite markers (average 5 
alleles/locus with PIC: 0.26 [35], 3.3 alleles/locus and PIC 0.48 
[39], 6 aUeles/locus with PIC: 0.68, [38] 5 alleles/locus with PIC: 
0.47 [45]) and genie microsatellite, intron-spanning and SNP 
markers (PIC: 0.34 [37]). Higher heterozygosity (0.02) among 82 
accessions belonging to six wUd Cicer species than that of 
cultivated species (0.0) and positive correlation of both PIC and 
gene diversity with number of aUeles/locus are in line with the 
findings of Upadhyaya et al. [40] . The cloning and sequencing of 
size variant amplicons of selected microsatellite marker sets from 
the Cicer accessions confirms that the allelic variation estimated in 
this study across diverse wUd and cultivated species was due to 
variation in their microsatellite repeat-length, but not because of 
size homoplasy (insertions/ deletions in the sequences flanking the 
repeat-motifs) of microsatellite markers as expected during inter- 
specific polymorphism [68,69]. A larger set of 478 genie and 
genomic microsatellite markers (showing no significant homoplasy 
at inter-species level) and 380 TF gene-based SNP markers 
covering the whole genome selected in this study through then- 
validation and genotyping in 94 cultivated and wild Cicer 
accessions would thus have relevance for assessing the polymor- 
phic potential and understanding the diversity pattern, population 
genetic structure and phylogenetics in chickpea. 

A wide range (11 to 94%, mean: 62%) of genetic diversity 
observed in this study among 94 cultivated and wild Cicer 
accessions using genome-wide physically mapped 478 genie and 
genomic microsatellite markers and 380 TF-gene-based SNP 
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markers is higher than the molecular diversity level detected 
previously with the EST-deri\ ed (3 to 49% [68]) and genomic (37 
to 80% [34], 32 to 80% [36], 6 to 84% [39] and 34 to 88% [38]) 
microsatellite markers. In spite of high mutation rate of 
microsatellite markers in contrast to SNP markers across diverse 
species, the high linear tendency correlation (80%) between 
genetic distances detected by both these markers suggests no 
saturation of microsatellite markers at which the genetic distances 
are being measured. Therefore, the combined use of microsatellite 
and SNP markers in estimation of efficient and precise molecular 
diversity in 94 (Acer accessions is relevant. The genie micr()sat(;llite 
and SNP markers showing high inter-specific polymorphism 
(98.2%) and wider genetic diversity (13 to 78%, mean: 58%) 
specifically among wild and cultivated accessions could be utilized 
for precise identification of true inter-specific hybrids. Besides, 
such markers could also have utility in monitoring the introgres- 
sion breeding to access the transfer of target genie regions 
controlling the traits of agricultural importance including abiotic 
and biotic stress tolerance from wild relatives to the backgrounds 
of cultivated gene pool for their genetic improvement. Henceforth, 
a wider genetic base and functional molecular diversity detected 
by genie markers in the expressed sequence component of genome 
among cultivated and wild accessions would be of significance in 
selection of useful diverse parental cultivated and wild accessions 
in cross-breeding program for the perspective of chickpea varietal 
improvement. Remarkably, seven microsatellite and two SNP 
markers were identified as most informative that could differen- 
tiate all 94 accessions representing seven cultivated and wild Cicer 
species from each other. Therefore, these smaller set of 
microsatellite and SNP markers selected by us might assay more 
relevant regions of chickpea genome for estabhshing distinctness 
among accessions. 

The present study have made use of unrooted tree to decipher 
the phylogenetic relationships among 94 cultivated and wild 
chickpea accessions. The use of unrooted tree as a reliable 
approach for explaining evolutionary relationships among diverse 
crop genotypes has extensively been reported [60,61]. Correspon- 
dence of genetic relationships established among 94 cultivated and 
wild Cicer accessions using both microsatellite and SNP markers 
with their species and gene pools of origination than that of 
individual microsatellite and SNP markers suggested that the 
functional genetic diversity assayed in our study by combining 
both kinds of sequence-based markers is realistic and thus would 
be useful in chickpea genomics and breeding when applied to a 
larger set of contrasting accessions. The converse genetic attributes 
of microsatellite and SNP markers specifically in terms of their 
high/low mutation rate, multi-/bi-aUefic nature and abundance in 
genomic distribution might have complemented the level of 
polymorphic potential detected by each of these two marker 
system in 94 accessions with each other to produce a realistic 
estimate of molecular diversity in the chickpea genome. It is clearly 
evident from high degree- of g(;netic distance-based linear 
correlation (~80%) between microsatellite and SNP markers as 
also have been observed in previous studies of many crop plants 
including maize and rye [70,71]. 

The inclusion of accessions belonging to C. arietinum, C. 
reticulatum and C. echinospermum species into three different 
population groups (determined by population genetic structure) as 
members of primary gene pool and accessions representing from 
C. bijugum, C. pinnatifidum and C. judaicum species under 
secondary gene pool as three diverse populations was clearly 
evident. These observation are fairly comparable with earlier 
genetic diversity, population structure and phylogenetic relation- 
ship studies on the cultivated and wild Cicer accessions using 



morphological traits, inter-specific hybridization, karyotyping, 
seed storage protein and isozyme profiling [13-24,72] as well as 
random and sequence-based molecular markers [25-34,36- 
40,42,73,74], but with some remarka1)lc deviations. The mean 
Fgx among three wild species accessions of primary gene pool was 
estimated to be 0.55, whereas it was 0.60 between primary and 
secondary gene pools. Thus, the evaluated FgT is in line with 
expected evolutionary relationships among the cultivated and wild 
species accessions belonging to primary and secondary gene pools. 
In most of these studies, no distinct differentiation/groupings 
among the ac:cessi()ns l)cloiiging to (Acer spc-cies (C arietinum, C. 
reticulatum and C. echinospermum) of primary gene pools and also 
no collinear conclusive findings on origination of accessions 
representing C. hijugum and C. pinnatifidum either from primary 
or secondary gene pools was observed. In this context, our 
observation on distinct differentiation of all accessions included 
under six different annual/perennial (Acer species into six 
population groups based on their sp(;('ic-s/gene pools of origination 
is relevant. It could be due to combined use of genome-wide well 
distributed genie and genomic microsatellite and SNP markers 
that are physically mapped on eight chickpea chromosomes and 
thus might provide realistic estimate of diversity pattern and 
population structure among wild and cultivated Cicer accessions. 

The existence of about 5% admix ancestry among six 
population groups suggests that evolutionary divergence of all six 
cultivated and wild chickpea was from a common ancestor. It 
further concurs with the earlier reports on origination, diversifi- 
cation and domestication of early Neolithic chickpea along with 
founder crops in the Fertile Crescent (eastern Mediterranean 
region) about 10,000-12,000 years ago [1,75-77]. Subsequently, 
complicated domestication pattern and complex breeding history 
involving inter-crossing/introgression coupled with different 
strong adaptive selection pressure among wild and cultivated 
chickpea accessions have led to di\'erse admixture frequency across 
six population groups. Maximum admixtures among the Cicer 
species belonging to POP I (C. arietinum), POP III (C. 
echinospermum) and POP V (C. reticulatum) as members of 
primary gene pools indicate that their evolutionary closeness could 
be due to their domestication at the archaeological sites of South 
Eastern Turkey about 10,000 years ago [1,72-74]. It is supported 
well with commonly accepted presumptions of previous molecular 
studies [13,29,30,33,34,36-39,40,42]. In spite of close phyloge- 
netic relationship of more diverse C. reticulatum accessions under 
POP V with accessions of C. echinospermum (POP III) and C. 
arietinum (POP I), we observed low genetic variability among 
cultivated desi and kahuli accessions of C arientinum. It could be 
due to maximum adaptation-based selection pressure during the 
domestication of cultivated C. arientinum from wild C. reticula- 
tum, followed by four sequential evolutionary bottienecks which in 
turn narrow down the genetic bas(^ in ('ulti\-at(^d chickpea [1.2,5]. 
A very limited eco-geographie distribution and narrow adaptation 
of wild progenitor accessions belonging to primary and secondary 
gene pools in south-eastern Turkey [78] and combined impacts of 
many factors associated with their domestication pattern, includ- 
ing founder effect in the ancient Neolithic era, separation of 
reproductive phases during early Bronze age, adaption-based 
selection (against vernalization response) for a wide-range of agro- 
ecological niches and modern breeding efforts might have caused 
low introgressive gene flow from wild to cultivated gene pools [3] . 
It ultimately resulted in narrow genetic base in cultivated desi and 
kahuli population as compared to other wild population groups. 
The estimation of higher genetic diversity particularly among the 
wild species/ accessions originated from Fertile Crescent than that 
of cultivated accessions from Central Asia supported these 
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findings. A maximum admixture (~9.1%) between POP II (C. 
bijugum) and POP IV (C. pinnatifidum) is expected as they both 
belong to secondary gene pools. Remarkably, population genetic 
structure of a perennial wild accession C. microphyllum included 
under POP V (C. reliculalum) contained more admixed ancestry 
fi-om (]. arielinum (POP I), C. reticulatum (POP V), C. 
echinospermum (POP III) of primary gene pools rather than C. 
pinnatifidum (POP IV) of secondary gene pools. The perennial C. 
microphyllum accession used in our study is of Indian origin and 
thus showed higher admixtures with Indian originated POP I (C. 
arietinum) and its two other evolutionarUy closely related 
population groups (POP III and POP V) of primary gene pools. 
The diversity, population genetic structure, domestication and 
phylogenetics-related information generated in this study for 94 
wild and cultivated Cicer accessions have implications for many 
genotyping applications including association analysis and genet- 
ic/QTL mapping targeting different qualitative and quantitative 
traits of agricultural importance in chickpea. 

Higher ID estimates and proportion of significant ID obtained 
in all six population groups (comprising of 94 accessions) and also 
across eight chickpea chromosomes using the linked markers in 
contrast to unlinked markers reflected the direct correlation of LD 
patterning with physical linkage of markers on chromosomes and 
marker density required to cover the genomic regions. The 
extensive ID estimates and extended chromosomal LD decay 
(~500-600 kb) determined in the six population groups using the 
microsatellite and SNP markers were much higher than that 
reported previously in cross-pollinated [58,79-83] and non- 
domesticated [84,85] self-poUinated plant species, but comparable 
to domesticated selling crop species like barley [86,87]. The 
extensive LD estimates and extended ID decay in the domesti- 
cated self-pollinated crop species like chickpea in contrast to other 
domesticated self-poUinated crops like rice and soybean could be 
due to extensive contribution of four serjuential bottlenecks during 
the domestication of chickpea [1,3-5], which resulted in the 
reduction of genetic diversity in cultivated chickpea than that of 
other domesticated selfing crop plant species. Slower LD decay of 
more diverse population groups (POP II-VI) as compared to POP 
I representing the cultivated desi and kabuli accessions of C. 
arietinum is expected because of four sequential evolutionary 
bottlenecks occurring during their evolutionary divergence and 
domestication with C. reticulatum crop wild relative [4,5,77]. 
Remarkably, a varied LD decay was observ ed in six population 
groups even by combined use of higher number of genie and 
genomic microsatellite and SNP markers distributed across eight 
chromosomes. It suggests the effect of factors other than marker- 
density including genetic diversity and population genetic structure 
on shaping the ID patterns [86] specifically in wild Cicer 
accessions. Overall, we inferred the population-specific LD 
patterns by considering the genome-wide recombination rates in 
94 cultivated and wild Cicer accessions. It gave us an insight about 
the emerging strategies that have been utilized in HapMap 
projects of many crop plants [58,60,61] for precise natural allelic 
diversity estimation as well as efficient trait association analysis and 
genetic/QTL mapping. 

Conclusions 

High genotyping success rate and intra-Zand inter-specific 
polymorphic potential (97.5%) was detected by large-scale 
validation of genome-wide physically mapped 478 genie and 
genomic microsatellite markers and 380 TF gene-derived SNP 
markers in 94 annual/perennial Cicer species using gel-based 
assay, fluorescent dye labelled automated fragment analyser and 



MALDI-TOF mass array. A wider genetic base (13-78'/'o) and 
thus a higher natural allelic diversity specifically in the functionally 
relevant genie regions of wild chickpea was unraveled suggesting 
their utility as potential resource for trait enhancement of 
cultivated gene pool. The informative 858 mapped markers thus 
have potential for selecting divergent cultivated and wild 
accessions and identifying true inter-specific hybrids for successful 
introgression breeding program, which will then facilitate the 
transfer of target genomic/gene regions controUing important 
agronomic traits from wild to cultivated chickpea for their genetic 
improvement. Wider genetic base, distinct species/gene pool-wise 
differentiation, admixed domestication pattern, differential ge- 
nome-wide recombination, extended LD decay (500-600 kb) and 
extensive LD estimates (0.19-0.27) in a six structured wild and 
cultivated population were observed. The molecular diversity and 
LD-based information generated in our study considering the 
genome-wide recombination rate in six diverse wild and cultivated 
population gave clues to select informative markers for precise 
allelic diversity estimation and marker-based large-scale genotyp- 
ing applications in chickpea. It wiU further help us to identify 
natural targets for chickpea genomics-assisted breeding by deriving 
the causal impact of natural selection on adaptation of diverse wild 
and cultivated chickpea populations that are domesticated across 
various agro-climatic regions. 

Supporting information 

Figure SI A representative 34-plex extension mass 
spectra for 94 cultivated and wild Cicer accessions 
obtained through MALDI-TOF mass array genotyping 
assay. The SNP IDs and their corresponding SNP loci are 
indicated on the top of each spectra. 
(PDF) 

Figure S2 Functional annotation of informative 334 
microsatellite and 380 SNP markers validated in the 
genes showed maximum correspondence to transcrip- 
tion factor gene families (73.3%), followed by genes 
controlling growth and metabolism enzymes (23.8%) 
and expressed proteins (2.7%). 
(PDF) 

Figure S3 The sequencing of cloned amplicons from a 
microsatellite marker showing fragment length poly- 
morphism (156 and 162 bp) among cultivated and wild 
accessions, validated the presence of expected repeat- 
motifs (TTG)„ and further corresponded with their 
expansion/ contraction [(TTG)9/(TTG)ii] of number of 
repeat-units. 
(PDF) 

Figure S4 Correlation between genetic distances detect- 
ed by microsatellite and SNP markers among 94 
accessions belonging to seven wild and cultivated 
species. Each blue colored dot represents the genetic distance 
between a pair of accessions based on allele sharing of 
microsatellite (y-axis) and SNP markers (x-axis). 
(PDF) 

Figure S5 Unrooted phylogenetic tree depicting the 
genetic relationships among 94 cultivated and wild 
accessions belonging to seven Cicer species based on 
Nei's genetic distance using 380 TF gene-derived SNP 
markers. Molecular classification is not able to difierentiate 
accessions into six different clusters as expected based on their 
species and gene pools of origination. 
(PDF) 
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Figure S6 Unrooted phylogenetic tree depicting the 
genetic relationships among 94 cultivated and wild 
accessions belonging to seven Cicer species based on 
Nei's genetic distance using 478 micros atellite markers 
(including 334 genie and 144 genomic microsatellite 
markers). Molecular classification clearly difiFerentiated 94 
accessions into six different clusters, which corresponds to their 
species and gene pools of origination with deviations. 
(PDF) 

Figure S7 Optimization of number of populations (K 
value) varying from K=l to 10 to determine best 
possible population number for 94 cultivated and wild 
Cicer accessions using the ad hoc procedure (A) of 
STRUCTURE documented by Pritchard et al. (2000) and 
second order statistics (delta K) (B) of Evanno et al. 
(2005). 
(PDF) 

Table SI Wild and cultivated Cicer accessions used in 
the study and their inferred ancestry coefficients in 
population genetic structure analysis. 

(PDF) 

Table S2 Summary of 496 including 343 genie and 153 
genomic microsatellite markers genotyped in 94 annu- 
al/ perennial cultivated and wild Cicer accessions using 
gel-based assay and fluorescent dye labelled automated 
fragment analyzer. 
(PDF) 

Table S3 Summary of 384 TF gene-derived SNPs 
genotyped in 94 annual/perennial cultivated and wild 
Cicer accessions using high-throughput MALDI-TOF 
mass array. 

(PDF) 

Table S4 Polymorphic potential (represented by di- 
verse statistical measures) detected by 478 genie and 

References 

1 . Abbo S, Berger J, Turner NC (2003) Evolution of cultivated chickpea: four 
bottlenecks limit diversity and constrain adaptation. Funct Plant Biol 30: 1081— 
1087. 

2. Kumar J, Abbo S (2001) Cirnctics of flowering time in chickpea and its bearing 
on productivity in semiarid environments. Adv Agron 72: 107—138. 

3. Berger J, Abbo S, Turner NC (2003) Ecogeography of annual wild Cicer species: 
the poor state of the world collection. Crop Sci 43: 1076-1090. 

4. Berger JD, Buck R, Henzell JM, Turner NC (2005) Evolution in the genus Cker 
vernalisation response and low temperature pod set in chickpea (C. anetinum L.) 
and its annual wild relatives. Aust J Agric Res 56: 1191—1200. 

5. Toker C (2009) A note on the evolution ofkahuli chickpeas as shown by induced 
mutations in Cim' retkulatum Ladizinsky. Genet Resour Crop Evol 56: 7—12. 

6. Singh KB, Malhotra RS, Saxena AlC (199,')) .Additional sources of tolerance to 
cold in cultivated and wild Cicer species. Crop Sci 35; 1491-1497. 

7. Kumar J, Van Rhecncn flA (2000) A major gene for time of flowering in 
chickpea. J Hercd 91; 67-68. 

8. Dwi\'edi SL, Upadhyaya HD. Balaji J. Buhariwalla HK, Blair M\V, et al. (2005) 
Using genomics to exploit grain legume biodiversity in crop improvement. Plant 
Breed Rev 26: 171-357. 

9. Shah TM, Hassan MU, Haq MA, Atta BM, Alam SS, et al. (2005) Evaluation of 
Cicer species for resistance to Ascochyta blight. Pak J Bot 37; 431-438. 

10. Singh R, Sharma P, Varshney RK, Sharma SK (2008) Chickpea improvement: 
role of wild species and genetic markers. Biotechnol Genet Eng 25: 267—313. 

11. Upadhyaya HD, Thudi M, Dronavalli N, (Jujaria N, Singh S, et al. (2011) 
Genomic tools and germplasm diversity for chickpea improvement. Plant Genet 
Resour 9; 4,5-58. doi; 10.1007/sl0528-012-9564-7 

12. CoUard BCY, Pang ECK, Ades PK, Taylor PVVJ (2003) Preliminary 
investigation of Q,TLs associated with seedling resistance to Ascochyta blight 
from Cicer echinospermum: a wild relative of chickpea. Theor Appl Genet 107: 
719-729. 

13. Kazan K, Muehlbauer FJ (1991) Allozyme variation and phylogeny in annual 
species of Cicer (Leguminosae). Plant Syst Evol 175; 11—21. 



genomic microsatellite markers and 380 TF gene- 
derived SNP markers among 94 cultivated and wild 
Cicer accessions. 

(PDF) 

Table S5 Molecular diversity (represented by different 
diversity statistical measures) detected by 478 genie and 
genomic microsatellite markers and 380 TF gene- 
derived SNP markers among 94 cultivated and wild 
accessions within and between Cicer species. 
(PDF) 

Table S6 Pair-wise estimates of genetic divergence (Fgx) 
and genetic distance among six model-based population 
groups. 

(PDF) 

Table S7 LD estimates in six model-based individual 
population groups and entire population as well as 
across eight chromosomes. 

(PDF) 

Acknowledgments 

We lhank the DNA Sequencing Faeility, NIPGR for automated fragment 
analysis and sequeneing. The authors also thank the Vice Chancellor, 
CSKHPKV, Palampur, Himachal Pradesh, India for allowing lo use the 
off-season nursery facility for reliable phenotyping at its Regional Station, 
Sangla. 

Author Contributions 

(Jonceived and designed the experiments: AKT SKP. Performed the 
experimenLs: MSS DB. .Analyzed the data: AKT SKP. Contributed Lo the 
writing of the manuscript; MSS DB. Marker genotyping, sequencing and 
diversity analysis: AK SD VK. Introduced and collected wild Cicer species 
from the global gene bank and hot spots, helped in constitution of diversity 
panel; MS KGB. Participated in drafting and correcting the manuscript 
critically and gave the final approval of the version to be published: AKT 
SKP. 



14. Ladizinsky G, Adlcr A (1976a) (icnctic relationships among the annual species of 
Cicer L. Theor Appl Genet 48: 197-203. 

15. Ladizinsky G, Adlcr A {1976b) The origin of chickpea Cirer anetmum L. 
Euphytica 25: 211-217. doi: 10.1007/BL00527371 

16. Ladizinsky G, Adler A (1975) The origin of chickpea as indicated by seed protein 
electrophoresis. IsraelJ Bot 24: 183-189. 

17. Ahmad F (1989) The chromosomal architecture of Cicer anatolicum Alef: a wild 
perennial relative of chickpea. Cytologia 54: 753-757. 

18. Ocampo B, Venora G, Errico A, Singh KB, Saccardo F (1992) Karyotype 
analysis in the genus Cicer.} Genet Breed 46: 229-240. 

19. Ahmad F, SHnkard AE (1992) Genetic relationships in the genus Cicer L. as 
revealed by polyacrylamide gel electrophoresis of seed storage proteins. Theor 
Appl Genet 84: 688-692. doi: 10. 1007/BF00224169 

20. Ahmad F, Gaur PM, Slinkard AE (1992) Isozyme polymorphism and 
phylogenetic interpretations in the genus Cicer L. Theor Appl Genet 83: 620— 
627. doi: 10.1007/BF00226907 

21. Ta\yar RI, Lukaszewski ..AJ, WainesJG (1994) Chromosome banding patterns in 
the annual species Cicer. Genome 37: 656—663. 

22. Labdi M, Robertson LD, Singh KB, Charrier A (1996) Genetic diversity and 
phylogenetic relationships among the annual Cicer species as revealed by 
isozyme polymorphism. Euphytica 88: 181—188. 

23. Tayyar RI, WainesJG (1996) Genetic relationships among annual species of 
Cicer (Fabaceae) using isozyme variation. Theor Appl Genet 92: 245-254. doi: 
10.10O7/BF0O223381 

24. Robertson LD, Ocampo B, Singh KB (1997) Morphological variation in wild 
annual Cicer species in comparison to the cultigen. Euphytica 95: 309—319. 

25. Ahmad F (1999) Random amplified polymorphic DNA (RAPD) analysis reveals 
genetic relationships among the annual Cicer species. Theor Appl Genet 98: 
657-663. 

26. Iruela M, Rubio J, Cubero JT, Gil J, Millan T (2002) Phylogenetic analysis in die 
genus Cicer and cultivated chickpea using RAPD Euid ISSR markers. Theor 
Appl Genet 104: 643-651. 



PLOS ONE I www.plosone.org 



13 



September 2014 | Volume 9 | Issue 9 | e107484 



Allelic Diversity in Wild Chickpea 



27. Rajcsh PN, TuUu A, GilJ, Gupta S, Ranjckar K, ct al. (2002) Identification of an 
STMS marker for the double-podding gene in chickpea. Theor Appl Genet 105: 
604-607. 

28. Chowdhury MA, Vandenberg V, Warkentin T (2002) Cultivar identification 
and genetic relationship among selected breeding lines and cultivars in chickpea 
{Cicer arietinum L.). Euphytica 127: 317-325. 

29. Sudupak MA, Akkaya MS, Kence A (2002) Analysis of genetic relationships 
among perennial and annual Cicer species growing in Turkey using RAPD 
markers. Theor Appl Genet 105: 1220-1228." 

30. Sudupak MA, ^^kkaya MS, Kencc A (2004) (ienetic relationships among 
perennial and annual Cicer species growing in 'I'urkev assessed by AFLP 
fingerprinting. Theor Appl Genet 108: 937-944 (2004). 

31. Shan F, Clarke HC, Plummer JA, Yan G, Siddiquc KH (2005) Geographical 
patterns of genetic variation in the world collections of wild annual Cicer 
characterized by amplified fragment length polymorphisms. Theor Appl Genet 
110: 381-391. 

32. Nguyen TT, Taylor ?\\'], Redden RJ, Ford R (2004) Genetic diversity estimates 
in Cicer using AFLP analysis. Plant Breed 123: 173- 179. 

33. Gingilli H, Altinkut A, Akcin A (2005) The use of mirrosatellite markers in the 
annual and perennial Cicer species growing in Turkey. Biologia Sect Bot 60: 93— 
98. 

34. Sethy NK, Choudhary S, Shokeen B, Bhatia S (2006) Identification of 
microsatellite markers from Cicer reticulatum: molecular variation and 
phylogenetic analysis. Theor Appl Genet 112: 347-357. 

35. Nayak SN, Zhu H, Varghese N, Datta S, Choi HK, et al. (2010) Integration of 
novel SSR and gene-based SNP marker loci in the chickpea genetic map and 
establishment of new anchor points with Medicago truncntiiln genome. Theor 
Appl Genet 120: 1415-1441. doi: 10.1007/s00122-010-1265-l ' 

36. Bharadw'aj G, Srivastava R, Chauhan SK, Satyavathi CT, Kumar J, et al. (201 1) 
\lolecular diversity and phylogeny in geographical collection of chickpea [Cicer 
sp.) accessions. J Genet 90: e94-lQ0. 

37. Gujaria N, Kumar J, Dauthal P, Dubey P, Hiremath P, et al. (2011) 
Development and use of genie molecular markers (GMMs) for construction of 
a transcript map of chickpea (Cicer arietinum L.). Theor Appl Genet 122: 1577- 
1589. doi: 10.1007/s00122-01 1-1556-1 

38. Saeed A. Hovsepyan H, Darvishzadeh R, Imtiaz M. Panguluri SK, et al. (201 1) 
Cjcnetic dix'cr.sity of Iranian accessions, improved lines of chickpea [Cicer 
arietinum L.) and their wild relatives by using simple sequence repeats. Plant 
Mol Biol Rep 29: 848-858. 

39. Choudhary P, Khanna SM, Jain PK, Bharadwaj C, Kumar J, et al (2012) 
Genetic structure and diversity analysis of the primary gene pool of chiclqjea 
using SSR markers. Genet Mol Res 11: 891-905. doi: 10.4238/2012 

40. Upadhyaya HD, Dwivedi SL, Baum M, Varshney RK, Udupa SM, ct al. (2008) 
Genetic structure, diversity, and allelic richness in composite collection and 
reference set in chickpea [Cicer arietinim L.). BMC Plant Biol 8: 106. doi: 
10.1186/1471-2229-8-106 

41. Jain M, Misra G, Patel RK, Priya P, Jhanwar S, et al. (2013) A draft genome 
sequence of the pulse crop chickpea [Cicer arietinum L.). Plant J 74: 715-729. 
doi: lO.lUl/tpj. 12173 

42. Varshney RK, Song C, Saxena RK, Azam S, Yu S, et al. (2013) Draft genome 
sequence of chickpea [Cicer arietinum) provides a resource for trait improve- 
ment. Nat Biotechnol 31: 240-246. doi: 10.1038/nbt.2491 

43. Jhanwar S, Priya P, Garg R, Parida SK, lyagi AK, et al. (2012) Transcriptome 
sequencing of wild chickpea as a rich resource lor marker development. Plant 
BiotechnolJ 10: 690-702. doi; 10.1111/j.l l(;7-7(ij2. 2012. 00712.x 

44. Agarwal G, Jhanwar S, Priya P, Singh \'K. Saxena MS. et al. (2012) 
Comparative analysis of kahiili chickpea transcriptome with desi and wild 
chickpea pro\'ides a rich resource f or development of functional markers. PLoS 
One 7: e52443. doi: 10.1371/journal.pone.0052443 

45. Kujur A, Bajaj D, Saxena MS, Tripathi S, Upadhyaya HD, et al. (2013) 
Functionally relevant microsatellite markers from chickpea transcription factor 
genes for efficient genot\'ping applications and trait association mapping. DNA 
Res 20: 355-374. doi: 10.1093/dnares/dst015 

46. Liu K, Muse SV (2005) Pow'erMarker: an integrated analysis environment for 
genetic marker analysis. Bioinfbrmatics 21: 2128-2129. 

47. Panda SK, Pandit A, Gaikwad K, Sharma TR, Srivasta\'a PS, ct al. (2010) 
Functionally relevant microsatellite s in sugarcane unigenes. BMC Plant Biol 10: 
251. doi: 10.1186/1471-2229-10-251 

48. Nei M, Tajima F, Tateno Y (1983) Accuracy of estimated phylogenetic trees 
from molecular data. J Mol Evol 19: 153-170. 

49. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, et al. (2011) MEGA5: 
Molecular evolutionary genetics analysis using maximum KkeHhood, evolution- 
ary distance, and maximum parsimony methods. Mol Biol Evol 28: 2731—2739. 
doi: 10.1093/molbev/msrl21 

50. Mantel N (1967) The detection of disease clustering and a generalized regression 
approach. Cancer Res 27: 209-220. 

51. PritchardJK, Stephens M, Donnelly P (2000) Inference of population structure 
using multilocus genotype data. Genetics 155: 945-959. 

52. Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of 
individuals using the software STRUCTURE: a simulation study. Mol Ecol 14: 
2611-2620. 

53. Peakall R, Smouse PE (2006) GENALLX 6: genetic analysis in Excel. 
Population genetic software for teaching and research. Mol Ecol Notes 6: 
288-295. 



54. Bradbury PJ Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, et al. (2007) 
TASSEL: Software for association mapping of complex traits in diverse samples. 
Bioinformatics 23: 2633-2635. 

55. Madier KA, Caicedo AL, Polato NR, Olsen KM, McCouch S, et al. (2007) 
Extent of linkage disequilibrium in rice {Oryza sativa L.). Genetics 177: 2223- 
2232. 

56. Yan WG, Li Y, Agrama HA, Luo D, Gao G, et al. (2009) Association mapping 
of stigma and spikelet characteristics in rice {Oryza sativa L.). Mol Breed 24: 

277-292. 

57. OrjuelaJ, Garavito A, Bouniol M, Arbclaez JD, Moreno L, et al. (2010) A 
universal core genetic map for rice. I'heor Appl (ienet 120: 563-572. doi: 
10.1007/s00122-009-l 176-1 

58. Gore MA, Chia JM, Elshire RJ, Sun Q, Ersoz ES, et al. (2009) A first-generation 
haplotype map of maize. Science 326: 1115—1117. doi: 10.1 126/ science. 1 177837 

59. Perez-de-Castro AM, VUanova S, Canizares J, Pascual L, Blanca JM, et al. 

(2012) Application of genomic tools in plant breeding. Curr Genomics 13: 179- 
195. 

60. Zhao K, Tung CW, Eizenga GC, Wright MH, Ah ML, et al. (201 1) Genome- 
w'ide association mapping reveals a rich genetic architecture of complex traits in 
Oryza sativa. Nat Commun 2: 467. doi: 10.1038/ncommsl467 

61. Huang X, Zhao Y, Wei X, Li C, Wang A, et al. !'2012; (jcnome-wide association 
study of flow'cring time and grain yield traits in a worldwide collection of rice 
germplasm. Nat Genet 44: 32-39. doi: 10.1038/ng.l018 

62. Hiremath PJ, Kumar A, Pcnmetsa RV, Farmer A, Schlueter JA, et al. (2012) 
Large-scale development of cost-effective SNP marker assays for diversity 
assessment and genetic mapping in chickpea and comparative mapping in 
legumes. Plant BiotechnolJ 10: 716-732. doi: 1 0. 1 1 1 1 /j. 1 467- 
7652.2012.00710.x 

63. Gaur R, Azam S, Jeena G, Khan AW, Choudhary S, et al. (2012) High- 
throughput SNP disco\'erv and genotyping for constructing a saturated linkage 
map of chickpea [Cicer arietinum L.). DNA Res 19: 357-373. doi: 10.1093/ 
dnares/dss018 

64. Roorkiwal M, Sawargaonkar SL, Chitikineni A, Thudi M, Saxena RK, et al. 

(2013) Single nucleotide polymorphism genotyping for breeding and genetics 
applications in chickpea and pigeonpca using the BeadXpress platform. Plant 
Genome 6: 1—10. 

65. Udvardi MK, Kaka K, Wandrey M, Montanari O, Murray J, et al. (2007) 
Legume transcription factors: global regulators of plant development and 
response to the environment. Plant Physiol 144: 538—549. 

66. Libault M, Joshi T, Benedito VA, Xu D, Udvardi MK, et al. (2009) Legume 
transcription factor genes; what makes legumes so special? Plant Physiol 151: 
991-1001. doi: 10. 1 104/pp. 109. 144105 

67. Agarwal P, Kapoor S, Tyagi AK (2011) Transcription factors regulating the 
progression of monocot and dicot seed development. Bioessays 33: 189-202. doi: 
10.1002/bies.201000107 

68. Choudhary S, Scthy NK, Shokeen B, Bhatia S (2009) Development of chickpea 
EST-SSR markers and analysis of allelic variation across related species. Theor 
Appl Genet 118: 591-608. doi: 10.1 007 /sOOl 22-008-092 3-z 

69. Estoup A, Jarne P, Cornuet JM (2002) Homoplasy and mutation model at 
microsatellite loci and their consequences for population genetics analysis. Mol 
Ecol 11: 1591-1604. 

70. Hamhlin MT, VVarburton ML, Buckler LS (2007) Empirical comparison of 
simple sequence repeats and single nucleotide polymorphisms in assessment of 
maize diversity and relatedness. PLoS One 2: el367. 

71. Varshney RK, Beier U, Khlestkina EK, Kota R, Korzun V, ct al. (2007) A single 
nucleotide polymorphisms in rye [Secale cereale L.): discovery, frequency, and 
applications for genome mapping and diversity studies. Theor Appl Genet 114: 
1105-1116. 

72. Robertson LD, Ocampo B, Singh KB (1997) Morphological variation in wild 
annual Cicer species in comparison to the cultigen. Euphytica 95: 309-319. 

73. Cnioumane W, Winter P, \\'eigand F, Kahl G (2000) Conservation and 
\'ariabiliiy of sec|iience-tagged microsatellites (STMS.s) from chickpea [Cicer 
aerietinum L.) within the genus Cicer. Theor Appl Genet 101: 269-278. 

74. Hutokshi KB, Jayashree B, Eshwar K, Grouch JH (2005) Development of ESTs 
from chickpea roots and their use in diversity analysis of the Cicer genus. BMC 
Plant Biol 5: 16. 

75. Zohary D, Hopf M (2000) Pulses In: Domestication of plants in the old world. 
Oxford: Clarendon press, pp. 108-111. 

76. Tanno K, Willcox G (2006) The origins of cultivation oi Cicer arietinum L. and 
Vicia faha L.: early finds from Tell el-Kerkh, north-west Syria, late 10th 
millennium B.P. Veg Hist Archaeobot 15: 197-204. 

77. Lev-Yadun S, Gopher A, Abbo S (1998) The cradle of agriculture. Science 288: 
1602-1603. 

78. Ladizinsky (t (1998) Plant evolution under domestication. Kluwer Academic 
Publishers: Dordrecht. NL. 

79. Xiao Y, Cai D, Yang W, Ye W, Younas M, et al. (2012) Genetic structure and 
linkage disequilibrium pattern of a rapeseed [Brassica napus L.) association 
mapping panel revealed by microsatellites. Theor Appl Genet 125: 437-447. 
doi: 10.1007/s00122-012-1843-5 

80. Riedelsheimer C, Lisec J, Czedik-Eysenberg A, Sulpice R, Flis A, et al. (2012) 
Genome-wide association mapping of leaf metabolic profiles for dissecting 
complex traits in maize. Proc Nati Acad Sci USA 109: 8872-887. doi: 10.1073/ 
pnas.1120813109 



PLOS ONE I www.plosone.org 



14 



September 2014 | Volume 9 | Issue 9 | e107484 



Allelic Diversity in Wild Chickpea 



81. Sakiroglu M, Shcrman-broylcs S, Story A, Moore KJ, Doyle JJ, ct al. (2012) 
Patterns of linkage disequilibrium and association mapping in diploid alfalfa (M. 
saliva L.). Theor Appl Genet 125: 577-590. doi: 10.1007/s00122-012-1854-2 

82. Hyten DL, Choi lY, Song Q, Shoemaker RC, Nelson RL, et al. (2007) Highly 
variable patterns of linkage disequilibrium in multiple soybean populations. 
Genetics 175: 1937-1944. 

83. Lam HM, Xu X, Liu X, Chen W, Yang G, et al. (2010) Rcsequencing of 31 wild 
and cultivated soybean genomes identifies patterns of genetic diversity and 
selection. Nat Genet 42: \o5,S-1059. doi: 10.1038/ng.715 

84. AtwcU S, Huang YS, Vilhjalmsson BJ, VViUcms G,' Horton M, et al. (2010) 
Genome-wide association study of 107 phenotypes in Arabidopm thaliana 
inbred Hues. Nature 465; 627-631. doi; 10.1038/naturc08800 



85. Branca A, Paape I'D, Zhou P, Briskinc R, Farmer AD, et al. (2011) Whole- 
genome nucleotide diversity, recombination, and linkage disequilibrium in the 
model legume Medicago truncatula. Proc Nad Acad Sci USA 108; E864— E870. 
doi; 10.1073/pnas.ll04032108 

86. Haseneyer G, Stracke S, Piepho HP, Sauer S, Gciger HH, et al. (2010) DNA 
polymorphisms and haplotype patterns of transcription factors involved in barley 
endosperm development are associated with key agronomic traits. BMC Plant 
Biol 10: 5. doi; 10.1186/1471-2229-10-5 

87. Varshncy RK, Paulo MJ, Grando S, Eeuwijk van FA, Keizer LCP, et al. (2012) 
Genome wide association analyses for drought tolerance related traits in barley 
{Hordeum vulgare L.). Field Crops Res 126: 171-180. 



PLOS ONE I www.plosone.org 



15 



September 2014 | Volume 9 | Issue 9 | el 07484 



